Time Delay Histogram Based Speech Source Separation Using a Planar Array
نویسندگان
چکیده
Bin-wise time delay is a valuable clue to form the timefrequency (TF) mask for speech source separation on the twomicrophone array. On widely spaces microphones, however, the time delay estimation suffers from spatial aliasing. Although histogram is a simple and effective method to tackle the problem of spatial aliasing, it can not be directly applied on planar arrays. This paper proposes a histogram-based method to separate multiple speech sources on the arbitrary-size planar array, where the spatial aliasing is resisted. Time delay histogram is firstly utilized to estimate the delays of multiple sources on each microphone pair. The estimated delays on all pairs are then incorporated into an azimuth histogram by means of the pairwise combination test. From the azimuth histogram, the direction-of-arrivals (DOAs) and the number of sources are obtained. Eventually, the TF mask is determined based on the estimated DOAs. Some experiments were conducted under various conditions, confirming the superiority of the proposed method.
منابع مشابه
A consideration on time-frequency masking methods for speech separation
Time-Frequency Masking methods, primary known as DUET [2] and SAFIA [3], are effective scheme for blind speech separation problem. Based on an investigation of conventional delay-histogram and the time-frequency masking method in terms of estimated delay accuracy, two novel approaches for clustering process are proposed. In particular, the proposed methods tend to improve relatively large amoun...
متن کاملSound Source Separation of N Sources from Stereo Signals via Fitting to N Models Each Lacking One Source
We present a system to perform sound source separation of an arbitrary number of speech or music sources from a stereo signal. We build on the work of other authors’ DUET system, which uses a histogram technique to estimate the mixing parameters of the time-frequency sparse sources, before using a nearest-neighbor approach to demix the sources. Herein, we describe a new demixing method called D...
متن کاملGradient Flow Broadband Beamforming and Source Separation
We present and demonstrate a method for blind separation and bearing estimation of broadband traveling waves, impinging on a sensor array with dimensions smaller than the shortest wavelength in the sources. By sensing spatial and temporal gradients of the received signal, the problem of separating mixtures of time-delayed sources reduces to that of separating instantaneous mixtures of the gradi...
متن کاملDOA estimation of speech signals using semi-blind source separation techniques
In this paper we investigate the application of complex independent component analysis (ICA) to the direction of arrival (DOA) estimation problem of wideband signals. The ICA based technique is semi-blind in the sense that the structure of the array is known to be uniform and linear (ULA). We show that when the array is ULA the mixing matrix is forced to have the structure imposed by the direct...
متن کاملMultiple moving speaker tracking by microphone array on mobile robot
Real-world applications often require tracking multiple moving speakers for improving human-robot interactions and/or sound source separation. This paper presents multiple moving speaker tracking using an 8ch microphone array system installed on a mobile robot. This problem is difficult because the system does not assume that sound sources and/or the microphone array are fixed. Our solutions co...
متن کامل